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Multi-Projector Mosaic with Automatic Registration 
Field of the Invention 

This invention relates generally to optical devices which project images on planar 
surface, and more particularly to projecting an arbitrary number of images to form 
5 a single mosaic displayed image. 

Background of the Invention 

3 A photo-mosaic is a 2D array of images registered together to form one large 

Jt) preceived image. The images are either taken from the same viewpoint in different 

J directions or taken from different viewpoints of a planar scene, see Szeliski et al. 

I S "Creating full view panoramic image mosaics and texture-mapped models," 

jU Computer Graphics (SIGGRAPH'97), pp. 251-258, 1997, Chen et al. "Quicklime 

j 7 VR - an image-based approach to virtual environment navigation," Comp. Graph. 

;|5 Proc, Annual Conf. Series (Siggraph 95), pp.29-38, 1995. and U.S. Patent 

5,986,668 "Deghosting method and apparatus for construction of image mosaics" 

issued to Szeliski, et al. on Nov. 16, 1999. 

Large projectors arrays are popular because they offer a practical solution to the 
20 problem of generating a high-resolution and bright image. Older systems, such as 
video walls, typically use an array of rear-projectors with abutting image edges 
separated by a small but always visible gap or overlap. Newer systems use 
overlapping projectors with facility for precise manual electro-mechanical 
adjustment for image registration and blending. The setting up of these displays is 
25 still quite tedious, requiring precise projector overlap, and often a near orthogonal 
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projection to the display surface, i.e., the display surface has to be perpendicular to 
the optical axis of each projector. This arguably is the most prominent drawback of 
large format display design. 

5 A projector is similar to a camera in that the image projection process can be 
expressed using the well-known pinhole camera model. Thus far, however, 
projectors have received little attention in the field of computer vision. 

Raskar et al. in "Multi-projector displays using camera-based registration/' IEEE 
10 Visualization, pp. 161-68, 1999 described various approaches to building a multi- 
25 projector display system. They provided a general solution to the seamless display 
y problem using stereo cameras to determine the display surface and individual 
% projector's intrinsic and extrinsic parameters all in a common coordinate frame. 
| S The result was an exhaustive description of the entire display environment. 

Although that approach allowed for a general solution, the computational effort 
i*f and resources needed to implement that approach introduced their own level of 
•il complexity. 

Chen et al. in "Automatic alignment of high-resolution multi- projector displays 
20 using an UN-calibrated camera, " IEEE Visualization 2000, 2000, provided a 
mechanism to reduce the problem of mechanical alignment using a camera with 
controllable zoom and focus, mounted on a pan-tilt unit. The data collection and 
computation took over thirty minutes. Surety in "Scalable Self-Calibration Display 
Technology for Seamless large-scale Displays," pH Thesis, Massachusetts Institute 
25 of Technology, 1999 presented a solution that also used a camera to establish the 
relative geometry of multiple projectors. The camera was calibrated by imaging a 
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piece of paper printed with a regularly spaced grid placed in front of the display 
surface. Subsequently, projector imagery was registered to the grid. 

It is desired to significantly reduce the support and cost for projector arrays. It is 
5 also desired to provide a flexible image registration and rendering technique that 
can adapt to a given projector array configuration. Furthermore, it is desired to 
fully automate and reduce the time required for setting up and registering multiple 
projectors. 

10 Summary of the Invention 

~ui The present invention provides a multi-projector display system having an 

5 arbitrary number of projectors. An automated method is provided for registering 

\~ the projector array to produce a seamless single rectangular displayed image. A 

15 single camera is used to determine the pose of the projector from the relative 

!sr,isv 

ff location and orientation of each projected image. The poses of the projectors are 
j£J used to register the projectors with respect to the planar display surface to achieve 
N registration and intensity blending of the images. During operation, source images 
are warped so that they appear correctly as a single image when projected onto the 
20 planar display surface. The warped images are then intensity corrected to account 
for overlap. The corrected images can then be projected. As an advantage, the 
display surface can be oblique to the optical axes of the projectors. 

More specifically, the invention provides a method and system for forming a 
25 mosaic image on a display surface with a multiple projectors. For each projector in 
turn, a registration image is projected onto the display surface so that a union of the 
projected registration images forms a polygon. 
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With a camera, for each registration image in turn, a corresponding input image is 
acquired. A display area on the display surface enclosed by the polygon is then 
identified, and a single projective matrix between the display area and each input 
5 image is determined for each projector. 

A source image for each projector is warped according to the corresponding single 
projective matrix of the projector. The pixels of the warped source image are 
weighted according to the projective matrix, and then the warped and weighted 

10 source images are concurretnly projected directly onto the display surface to form 

S the mosaic image. 

O Brief Description of the Drawings 

15 Figure 1 is a diagram of a multi-projector display system according to the 
CS invention; 

P Figure 2 is a flow diagram of a pre-processing phase of the display system of 
Figure 1; and 

20 

Figure 3 is a flow diagram of a rendering phase of the display system of figure 1 . 
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Detailed Description of the Preferred Embodiment 
Definitions 

5 In the description below, the following computer manipulated images are 
specifically distinguished and defined. 

Registration image: an image having a predetermined structure and geometry 
projected onto a planar display surface. Multiple registration images are projected 
10 sequentially, one for each projector in an array of projectors. 



:7! Camera image: an image acquired by a camera of the display surface while the 

5 registration image is projected. One camera image is acquired for each registration 

! -J image projected by one of the projectors. 

%5 

fj Source image: an image that is to be projected onto the display surface to form a 

W mosaic. Multiple source images are concurrently generated, one for each projector 

N= in an array of projectors. 
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Projected image: the source image after warping and blending. 

Displayed Image: the user perceived appearance of multiple projected images on 
5 the display surface. 

Display Area: a region of the display surface where the displayed image appears. 

Multi-Projector System Overview 

10 

J Figure 1 shows a self-correcting multi-projector display system 100 according to 

our invention. The system 100 includes an array of projectors 101-104, and a 
'% camera 110 coupled to a processor 120. For example, we use four Mitsubishi 
; 5f Electric Corporation X-80 projectors having a resolution of 1024x768 pixels, and a 
15 single Logitech Quickcam Pro USB camera with a resolution of 640x480 pixels. 
pB The system 100 can be used in front or rear projection mode. 

H The projectors 101-104 can be arbitrary in number and are installed casually aimed 
at a planar display surface 130 so that projected images 131-134 approximately 

20 overlap. Note, it is not necessary for the images 131-134 to be aligned, or to 

completely overlap. Indeed, the projected images 131-134 can be key-stoned, and 
rotated or skewed with respect to each other, and gaps may exist between images. 
Furthermore, the optical axes of the projectors can be oblique to the display surface 
and to each other. 

25 

Here, the projectors are stacked in a 2x2 array, however other configurations are 
possible, For example, all the projector can be stacked vertically or aligned 
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horizontally side-by-side. The only requirement is that the projectors generally 
point at the display surface 130 and generally have overlapping images in some 
display area 135. Similarly, the camera 110 is aimed so that its field of view covers 
at least the union of the images 131-134. Note, the camera is only used during a 
5 pre-processing phase 200, described in greater detail below. In the preferred 
embodiment, the projectors and camera are digital devices. 

The processor 120 includes input/output (I/O) devices 121 for user interaction. 

These are standard. The processor also includes a memory storing executable 
10 programs that implement the method that control the system 100 during operation. 
S The method that operates the system includes the pre-processing phase 200 and a 
fl rendering phase 300. These will be described in further detail below. With our self- 
9 correcting array of projectors is possible to generate a mosaic display that appears 
fiJ as a single displayed image of known shape, for example, a rectangle with a 
%5 specific aspect ratio, even when the individual projectors of the array are arbitrarily 
SQ aimed at an oblique planar surface. 

P Registration and Mosaic Method 

20 Rather than carry out a full registration for the projector-camera system 100, such 
as determining the intrinsic parameters for both types of devices plus their relative 
pose, our multi-projector system 100 only uses a single projective matrix to warp a 
source images for each projector. The single projective matrix is a combination of 
a camera to projector homography, and a display area to camera homography 

25 predetermined as decribed below. 
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Pre-Processing 

Figure 2 shows the steps performed during pre-processing 200. The registration 
pattern, e.g. a checkerboard 140 is projected 210 on the display surface for each of 
5 the projectors 101-104 in turn. The correspondences (C) 221 of registration pattern 
features, e.g., corners or lines, are extracted from the input images 111. We use 
forty-eight correspondences to achieve sub-pixel registration accuracy. The 
correspondences 221 are used to determine a homography (H ci ) 231 between the 
camera and each projector L 

10 

Ji After the homographies for all projectors are known, the dimensions (D) 241 of the 
h] usable display area 135 is determined 240. The dimensions 241 are of the largest 
2 rectangle (or any other predefined shape) that is enclosed by the union of all of the 
5J projected images 131-134. It should be noted, that the display area can have other 
%5 predetermined shapes regular or irregular, for example, circular, triangular, oval, 
U diamond, face, etc. The shape can be defined parametrically or by a shape mask. 

^ After the dimensions of the usable display area 135 are known, the portions of 
projected images that are outside the display area and that overlap other projected 

20 images can be identified. This allows us to determine 250 a homography (H rc ) 251 
between the display coordinates and the camera coordinates, as well as intensity 
weights (W) 261 that need to be applied during the rendering phase. 

Pixels that are outside the display area have a weight of zero (black), and pixels in 
25 overlapping portions have a weight in the range 0 < W < 1, all other pixels have a 
weight of one. During blending, described below, each pixel is multiplied by its 
corresponding weight. 
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Rendering 

During rendering 300 as shown in Figure 3, source images (I) 301 to be projected 
5 are first warped 310 according to the single projective matrix Bj 302. Then, the 
warped image / 31 1 is intensity weighted during blending 320 to / 321, and 
projected 330 onto the display area 135. The intensity weights are on a per pixel 
basis. For example, the weights are stored in alpha-maps as are used by a 
convnetional rendering engine. 

10 

Homographies 

9 We describe how the two homographies 23 1 and 25 1 are used to define a single 
H projective matrix Bi 302 used during rendering. We assume a pin-hole camera 
J.5 model. If two cameras acquire two images of a 3D plane n, then the two images 
W are related by a homography H, defined up to scale (~=). If mi and m 2 are 
HI projections of a 3D location M in the plane n, then m2 ~= Hmi , where ml and m2 
M are homogeneous coordinates, and ~= means equality up to scale. 

20 In our multi-projector system 100, we desire to seamlessly stitch N source images 
from multiple projectors P i5 for i = 1,. . N, into a single displayed image. In order 
to align the projectors with each other, we use a single camera 1 10 to acquire all of 
the N input images of the respective registration images. The projector to camera 
mapping, as well as a relative projector to projector mapping are then described by 

25 the above homographies due to the use of the planar display surface 130. 
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For notation, we choose homogeneous coordinates for both 2D camera coordinates 
x c = (x 9 y, l) r , and for 2D projector coordinates Ui = (u, v, if, for i = 1,.., N, from 
multiple projectors. In such a context, the N projected images map to the single 
input image 131 with the known homographies, H cb H c2 , H cN , satisfying Uj ~= 
5 H ci x c , fori = l,...,N (1) 
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In the displayed image, the projected images 131-134 are clipped to a known 
shape. The coordinates of the displayed image inside this shape, which are 
equivalent to the coordinates of the projected images, are denoted by x r =(x 9 y, 1) . 

The relationship between the display coordinates and the camera coordinates can 
be described by a second 2D projective matrix H rc . Therefore, we have 

uj ~= H ci x c ~= (H ci H rc )x r , f or / = 1 , . . . , N. (2) 



%!5 For simplicity of illustration, we define a set of homographies: 
P? H ri =H ci H rc , for/=l,...,N, 

W] which specify directly the geometrical relationship between each projector and the 
M= displayed image. Then, the pixel mapping between two arbitrary projected images 
is 

20 uj ~= HgHrf-V (3) 

where Ui and Uj denote the corresponding pixels in the source images of projector 
Pi and Pj, respectively. 



Registration 

25 

To generate a seamless displayed image, we ensure that the projected images are 
registered with each other, and that the intensities across overlap region appear to 
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transition smoothly. We first describe the technique used to register the multiple 
projectors 101-104 with the single camera 1 10, and then we describe the smooth 
intensity blending 320. 

5 In this subsection, we describe a process for determining the relationship between 
projected pixels. This relationship is implicitly used during rendering 300 to ensure 
that the projected images are properly registered by warping the source images 
301 . For a projected mosaic, we do not explicitly calibrate either camera or the 
projectors, as in the prior art, in order to seamlessly stitch the projected images. 
10 Instead, the homographies between multiple projectors and the static camera are 
Ji sufficient to correctly align the multiple projected images. In other words, we do 
y need to a calibration of the optics to determine intrinsic and extrinsic parameters, 
J as in the prior art. Instead, we rely only on the variuous homographies. 

;U> To register the projectors 101-104 with the cameral 1 10, we first project 210 the 
ff registration pattern 140 onto the display surface 130 with each of the projectors 
[£j 10 1 - 1 04 in turn, and acquire the corresponding input images 1 1 1 . By extracting the 
^ feature from the 2D input images 111 corresponding to known 2D features in the 

registration pattern, we can determine the homography between the camera and 
20 each projector based on ( 1 ) . 

More formally, the homography 231 between the camera 110 and a given projector 
can be formulated as maximum likelihood estimation problem. Given n 
corresponding features between the input image x, and the registration image u, the 
25 maximum likelihood estimate can be obtained by minimizing a cost function 

Ik-H^, (4) 
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where H is the homography from the camera to the projector, up to a scale. With 
four or more correspondences between the camera image and the registration 
image, the eight unknown parameters of H are determined using a least-squares 
method. More sophisticated techniques can improve robustness to noise, or 
5 outliers. In any case, the registration is done automatically. Furthermore, as 
described above, the relative mapping between any two projectors can be 
determined by combining Equation (3) and (4). Note, again an explicit calibration 
is not needed. 

10 In order to form the displayed image 135, we find the maximum size shape, e.g., 

rectangle, that is enclosed by the union of images that would concurrently be 
0 projected by all of the projectors. This problem can be formulated as a constrained 
2 linear optimization problem. 

15 The illuminated quadrilaterals 13 1- 1 34 are specified in the coordinate system of 
the camera 1 10. We want to find, e.g., a largest axis aligned rectangle of a given 
aspect ratio enclosed by the union of N quadrilaterals. The union is a polygon L 
136 marked in Figure 1 with heavy edges. We assume that at least one vertex of 
the largest rectangle lies on an edge of the polygon L. Because, the aspect ratio is 
20 fixed, the rectangle has three degrees of freedom, position of one of the vertex in 
2D, and scale of the rectangle. Therefore, we discrete the edges of the polygon L 
136, and solve for the largest rectangle by testing for each possible position of the 
rectangle vertex resting on edges of L and each possible scale. The test checks 
whether the rectangle edges intersect the edges of the polygon L. After we find the 
25 near optimal inscribed rectangle, we update the homography from the display 
coordinate system to projector pixels using Equation (2). 
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It is also possible to build a scalable multi-projector system 100 by adding more 
projectors, even if the single camera 1 10 cannot view all the projected images. The 
only requirement is to define the relationship between any one projector and 
corresponding coordinates of the displayed image 135. In the preferred 
5 embodiment, the inscribed rectangle in the viewing camera defines the 
relationship. But, any mapping between one or more cameras and display 
coordinates in an Euclidean framework can be used. Hence, even if a single camera 
does not view all the projections, it is easy to propagate the relationship (H^H^), 
between neighboring projectors i and j, as long as any one single camera can view 
1 0 both the projections . 

Rendering 

!j* The displayed image 135 are rendered 300 as a scaled and seamless version of 
45 multiple projected images 321 even under oblique projection due to individual 
CO casually installed projectors. As shown in Figure 3, we first warp each source 
IH image 301 to be projected according to the projective matrix 302 so that when it is 
H projected onto the display surface 130 all of the warped images appear aligned 

inside the display area 135. Each warped image 31 1 is then blended by the weights 
20 361 to produce the projected image 321 , which is fed to one of the projectors 101- 

104. 

Warping using Homography 

25 The warping required for the projected image is defined by the projective matrix 
that transformas pixel coordinates of the projected images to pixel coordinates of 
the desired displayed image. As described above, th projective matrix 302 is is 
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determined, during the pre-processing. At each projector, the rendering process 
independently warps each source image 301. 

The source image 301 is loaded in a texture memory of the projector. We then 
texture map a unit rectangle in the x-y plane, i.e., with extents [0:1, 0:1]. 
Because x, ~= H ri u r , for projector i, we use the single projective matrix 

Bj =H ri [l I 0], (5) 
where [110] converts a 3x3 matrix to a 4x4 matrix by an identity matrix padded 
out with zero vectors, see equation 6 below. 

To be precise, we use an orthographic projection matrix, which is multiplied by 
H ri . The pixels in regions outside display coordinates [0:1, 0:1] also appear in the 
projector frame buffer. This corresponds to the region outside the display area but 
enclosed by the polygon L 136, i.e., the union of N illuminated quadrilaterals. 



We keep the background color (zero weight) black to avoid contributing to this 
region. The projective matrix is converted into a 4x4 matrix as used in a 
conventional graphics pipeline by simply adding a row and a column for depth 
buffer calculations. The resultant depth values are, of course, ignored, e.g. 



[110] = 



1000 
0100 
0010 
0000 



(6) 
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Image Intensity Blending 

Regions of the display area 135 that are illuminated by multiple projectors would 
normally appear noticeably brighter. To make the overlap regions appear seamless, 
5 i.e., having smooth intensities, we use an intensity blending process. We generate a 
mask for each projector, which assign an intensity weight in the range 0.0 to 1.0 
for every pixel in the projected images. 

During the blending, we load the per-pixel intensity weight into an alpha channel 
10 of a second texture map. However, the colors channels are black. The same unit 
S rectangle is rendered with this texture map modifying the intensity weights. The 
S alpha channel acts a transparency function, multiplying the underlying intensities 
1 of the warped image 311. Thus, both operations, warping using the projective 
pJ matrix and the intensity correction are achieved efficiently using 3D graphics 
15 hardware. 

U1 It should be understood that multiple projectors can overlap at the same 

C illuminated point. For example, at the center of the display area 135 of Figure 1 , 

four projectors may well overlap. However, for uniform intensity, the weights of 
20 all projected pixels illuminating the same display surface point are made to add up 

to one. 

A simple solution would illuminate a given point with only a single projector. 
Alternatively, each projector could contribute an equal amount, e.g., 0.5 for two 
25 projectors, or 0.25 for four. In practice, due to small errors in projector registration, 
perspective distortions, and non-linear barrel distortions, the projected images will 
not match exactly at their respective edges. In addition, over time, electro- 

15 



CR-1347 
Raskar et al. 

mechanical vibrations disturb the positions of projectors. For any of these 
problems, even a misalignment of single pixel would cause an annoying line in the 
displayed image. Hence, there is a need to achieve a smooth transition of weights 
in the overlap. The intensities in the resulting superimposed images have to be 
5 reduced sensitivity to the static registration and dynamic registration error. 

Our blending algorithm uses a strategy where the weights assigned to pixels near 
the edges is near zero. The weight for pixels in non-overlap regions is clearly one, 
and neighboring pixels in the overlap region are assigned weights as described 
10 below. 

ft More specifically, to find a weight A m (u) associated with a pixel u =(u, v, 1), we 
Q apply a blending technique, i.e., we weigh the pixels in the warped source image 
RJ 311 proportional to its distance to an edge of the image. More precisely, the pixels 
15 are weighted proportional to their distance to a nearest "invisible" pixel, invisible 
5 being a pixel having zero weight. The projective matrices are computed with 
5 normalized projector coordinates so that the u and v coordinates vary between 
H [0, 1]. Hence, the distance of a pixel to the closest edge in the projector Pi is 
described by 

20 dj(u) = w(u, v)min(u, v, 1- u, 1- v) (5) 

where, w(u, v) = 1, if u e [0,1], and v e [0,1],=0 otherwise. 

This reduces the weights assignment problem, to a min function. Further, based on 
the implicit pixel correspondences across the multiple projectors, we ensure that 
25 the weight of pixels illuminating the same display surface adds up to one. The 
pixel weight A m (u) associated with pixel u of projector P m is evaluated as follows: 
A m (u) = d^uVCSidiCHriHnn^u)), for i = 1...N (6) 
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This type of intensity blending is more stable in presence of small mis-registration 
errors, whether the error results in a gap between the projected images or extra 
overlap. As explained below, the weights are implemented using the commonly 
5 used alpha-maps for transparency in 3D graphics hardware. Last we project 330 
the warped and intensity weighted image 321 in each projector to produce the 
displayed image. 

Effect of the Invention 

10 

0 The invention is effective for quickly setting up a large number of digital projectors 
J so that the combined projections appear as a single cohesive image without 

| noticeable overlap in the images. The projectors are coupled to a processor. The user 

m merely aims the projector in the general direction of a display surface so that 

jf$ individual projected images overlap or nearly overlap in some predetermined form, 

1 for example a 2x3 matrix for six projectors. Once, the projectors are approximately 
|t positions, a registration image is displayed by each projector to register the 

F? projectors with each other and the display surface. As soon as the projectors are 
registered they can be used to display a seamless mosaic of images. The whole 
20 process takes a couple of seconds. 

This invention is described using specific terms and examples. It is to be 
understood that various other adaptations and modifications may be made within 
the spirit and scope of the invention. Therefore, it is the object of the appended 
25 claims to cover all such variations and modifications as come within the true spirit 
and scope of the invention. 
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